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Abstract 

In a classical problem in scheduling, one has n unit size jobs with a precedence or¬ 
der and the goal is to find a schedule of those jobs on m identical machines as to 
minimize the makespan. It is one of the remaining four open problems from the 
hook of Garey & Johnson whether or not this problem is NP-hard for m-3. 

We prove that for any fixed e and m, a Sherali-Adams / Lasserre lift of the time- 
index LP with a slightly super poly-logarithmic number of r - (log(n))®^*°®*°®”^ rounds 
provides a (1 -i- c)-approximation. This implies an algorithm that yields a (1 -i- £)- 
approximation in time . The previously best approximation algorithms guar¬ 
antee a 2 - approximation in polynomial time for m > 4 and | for m = 3. Our 
algorithm is based on a recursive scheduling approach where in each step we re¬ 
duce the correlation in form of long chains. Our method adds to the rather short list 
of examples where hierarchies are actually useful to obtain better approximation 
algorithms. 


1 Introduction 

One of the landmarks in the theory of scheduling is the paper of Graham IGra66l from 
1966, dealing with the following problem: suppose we have a set J of n jobs, each one 
with a running time pj along with m identical parallel machines that we can use to pro¬ 
cess the jobs. Moreover, the input contains a precedence order on the jobs; we write 
j < j' if job j has to be completed before job j' can be started. The goal is to sched¬ 
ule the jobs in a non-preemptive fashion so that the makespan is minimized. Here, the 
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makespan gives the time that the last joh is finished. In the 3-fleld notatioi^ this prob¬ 
lem is abbreviated as P \ prec | Cmax- Graham showed that the following list schedule 
gives a (2 - -approximation on the makespan; compute an arbitrary topological or¬ 
dering of the jobs and whenever a machine becomes idle, select the first available job 
from the list. It had been known since the late 70’s that it is NP-hard to approximate 
the problem better than within a factor of 4/3 due to Lenstra and Rinnooy Kan ILK78I 
and Schuurman and Woeginger ISW99I prominently placed the quest for any improve¬ 
ment on their well known list of 10 open problems in scheduling. Finally in 2010, Svens- 
son ISvelOl showed that assuming a variant of the unique games conjecture IBK09I , there 
is no (2 - £)-approximation algorithm for P \ prec, pj = I \ Cmax- However, for unit size 
jobs, Lam and Sethi ILS77I analyzed an algorithm of Coffman and Graham and showed 
that it provides a slighly better guarantee of 2 - for P | prec, Pj = l\ Cmax- Later, Gangal 
and Ranade IGR08I gave an algorithm with a 2 - guarantee for m > 4. 

In a typical scheduling application, the number of jobs might be huge compared 
to the number of machines, which does justify to ask for the complexity status of such 
problems if the number m of machines is a constant. Even under the additional re¬ 
striction of unit size jobs, no better approximation result is know. In fact, it is one of 
the remaining four open problems from the book of Garey and Johnson |GJ79| whether 
P3 I prec, Pj = l\ Cmax is even NP-hard. Also Schuurman and Woeginger ISW99I list un¬ 
der “Open Problem 1” the question whether there is a PTAS for this problem (recall that 
for m = 2, the result of ILS77I gives an optimum schedule). 

To understand where the lack of progress is coming from, one has to go back to the 
list scheduling algorithm of Graham. If we schedule the jobs in a greedy manner, then 
one can argue that there is always a chain of jobs j\< 72 < ... < jk so that at any point in 
time either all m machines were fully busy or a job from that chain was processed. Since 
both quantities, the load Lje/ Pj and the length of any chain are lower bounds on any 
schedule, we can conclude that the schedule has length at most 2 - OPT. One can shave 
off a factor of even for general running times, by observing that the processing times 
of the jobs in the longest chain do not need to be again counted in the load bound. Also 
the papers ILS77I and IGR08I effectively rely on those two lower bounds. In fact, lGha95l 
showed that a large class of algorithms including the ones of IGra66l[GR08l cannot beat 
a bound of 2 - moreover Graham’s algorithm is indeed not better than a [2- 
approximation for unit size jobs, see IGR08I . 

Of course, one always has the option to study the strength of linear programs for an 
optimization problem. The natural one for Pm | prec, pj = I \ Cmax is certainly the fol¬ 
lowing time-indexed LP: For a parameter T that denotes the length of the time horizon, 

hn the 3-field notation, the first field specifies the available processors, the 2nd field the jobs and the 
last field the objective function. In our case, Pm means that we have m identical machines; pj = l,prec 
indicates that the jobs have unit length and precedence constraints and the last field Cmax specifies that the 
objective function is to minimize the maximum completion time. 
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we define a set KiT) as the set of fractional solutions to: 


T 


t=i 

=. 

1 yjej 

jo 

< 

m VtejT] 

E 

> 

X xjf yi<jyte[T] 

t'<t 


t'<f+l 


= 

^Xj^t intervals 

tel 

0 < Xjt 

< 

1 yjejytelT] 


Here xj^t is a decision variable that is supposed to tell whether job J £ / is scheduled in 
time slot t e The constraints guarantee that in an integral solution each job is 

assigned to one time slot; no time slot receives more than m jobs and for a pair of jobs i < 
j, job i has to be scheduled before j. Note that we have introduced redundant looking 
variables xjj which for each interval I = {a,...,b} with I < a< b < T denote whether job 
j will be scheduled somewhere within that interval I. Those variables obviously do not 
change the feasibility of the LP, but they will be very helpful later for notational purposes. 

Unsurprisingly, this LP has a constant integrality gap as one can see from the follow¬ 
ing construction: take k blocks Ji,...,JicOi\Ji\ = m + l jobs each and define the prece¬ 
dence order so that all the jobs in /, have to be finished before any job in /,+i can be 
started. Any integral schedule needs two time units per block, hence OPT = 2k. On the 
other hand, the LP solution can schedule the m -i-1 jobs of each block “in parallel”, each 
at a rate of and finish the schedule after k • time units in which each machine 
has always been fully busy. This results in an integrality gap of at least 2 - 

It has been long known, that in principle one can take the linear program for any op¬ 
timization problem and strengthen it automatically by applying an LP or SDP hierarchy 
lift. We will provide formal definitions later, but basically these operators ensure that for 
any set of at most r variables, the LP solution indeed lies the convex hull of integral com¬ 
binations. Here, r is the number of levels or rounds and one typically needs time to 
solve an r-level hierarchy. 

Some known approximation results have been reinterpreted in hind side in this frame¬ 
work, for example a constant number of Lasserre rounds applied to a basic LP suffice 
for the Goemans-Williamson algorithm for MaxCut IGW95I and also a constant num¬ 
ber of Lasserre rounds imply the triangle inequalities in the 0(^logn)-approximation 
algorithm by Arora, Rao and Vazirani IARV09I . Moreover, the subspace enumeration 
component in the subexponential time algorithm of Arora, Barak and Steurer lABSlOl 
for Unique Games could be replaced with a Lasserre SDP. However, there are relatively 
few results where hierarchies have been genuinely useful (at least fewer than researchers 
have hoped for). For example Ghlamtac lGhl07l used SDP hierarchies to find better 
colorings in 3-colorable graphs and Raghavendra and Tan IRT12I apply them to ob¬ 
tain approximation algorithms for GSPs with cardinality constraints. An application to 
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color hypergraphs can he found in ICS08I . Hierarchies also turned out to he the right 
approach for Sparsest Cut in hounded tree width graphs, see the paper hy Chlamtac, 
Krauthgamer and Raghavendra ICKRIOI and the 2-approximation hy Gupta, Talwar and 
Witmer 1GTW13I . 

1.1 Our contribution 

Our main result is that a Lasserre lift with rounds closes the inte¬ 

grality gap of the above LP to at most 1 -i- £. This implies: 

Theorem 1. For the problem Pm \ prec, pj = l\ Cmax one can compute a (l-tc) -approximate 
solution in time where r := 

This gives a partial answer to one of the question under “Open Problem 1” in ISW99I 
which asked whether there is a PTAS for this problem, in a Dagstuhl workshop, Math- 
ieu |DaglO| asked the more specific question whether the Sherali-Adams hierarchy gives 
a (1 -t- £)-approximation after c{e, m) rounds. We also make progress on the question 
from the book of Garey and Johnson |GJ79| by improving the |-polynomial time ap¬ 
proximation for m = 3 ILS77I to a 1 -H a in slightly more than quasi-polynomial time, 
in particular, this implies that Pm \ prec, py = 1 | Cmax is not APX-hard, assuming that 
NP g 

We should point out that the same result could be proven with other hierarchies such 
as Sherali-Adams. We choose to work with the Lasserre hierarchy as it allows a cleaner 
mathematical definition, in our opinion. 

2 The Lasserre hierarchy 

in this section, we want to discuss the Lasserre hierarchy which was introduced in ILasOlal 
iLasOlbl . For the moment, let iC = {x e D?” | Ax > b} be some linear relaxation of an arbi¬ 
trary binary optimization problem, in any words, X is a relaxation of the integral hull 
Kj := convex n {0,1}”). Let us make the observation that for a vector xe Kj there is a dis¬ 
tribution X over X n {0,1}” so that x is the vector of marginal probabilities, that means 
Pr[X, = 1] = X,. if we only have xe K, then in general, there is no such probability dis¬ 
tribution. We can also make the observation that the vector x contains no information 
on the probability of joint events such as Pr[X; = Xj = 1]. The idea behind LP or SDP 
hierarchies in general, is to add additional variables to the system that contain this extra 
information of probabilities for joint events. 

In case of Lasserre’s hierarchy, we consider a vector y e which is a 2” - dimensional 

vector, indexed by subsets of variables. The motivation is that for any subset lQ{l,...,n} 
of variable indices, we would want the entry y/ to be the probability y/ = PrjAiE/lX, = 1)]. 
in particular this means that y^} = x, and y^ = 1. Note that any probability distribution 
over {0,1}” is determined by 2” -1 many values and one can use the Inclusion-Exclusion 
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Formula to derive all probabilities from just knowing tbe y/’s. Moreover, one can write 
down explicitly tbe constraints that ensure that the vector y encodes a probability dis¬ 
tribution. However, as y is 2"-dimensional and we aim at (quasi-)polynomial time algo¬ 
rithms, we will only introduce constraints that involve entries y/ for |7| < 2r -i-1 and r is 
the number of rounds or the level of our solution. This way we can still solve the system 
in time n^^''\ 

For a symmetric matrix M, we will write M > 0 if M is positive semideflnite, that 
means if all its Eigenvalues are non-negative. 

Definition. Let iC = {x e IR" | Ax > b} with A e and b e R'”. We define the r-th level 
of the Lasserre hierarchy LASr (iO as the set of vectors y e R^'"' that satisfy 



>0 V£e[m]; y^ 


The matrix Mr iy) is called the moment matrix of y and the second type M^iy) is called 
moment matrix of slacks. Furthermore, let := Kyp}, • • •, y{n]) I y ^ Las,- [K)} be 

the projection on the original variables. 

We want to emphasize that the r-th level of Lasserre cares only for entries y/ with 
|/| < 2 r -t 1 - all other entries are ignored (if the reader prefers a proper dehnition, he 
might impose y/ = 0 for |/| > 2r -i-1). The set of positive semidehnite matrices forms a 
non-polyhedral convex cone, thus also LASriK) is a convex set. The separation problem 
for LASr(7f) boils down to computing a negative Eigenvector of the involved matrices, 
which can he done in polynomial time. As a consequence, if m denotes the number of 
linear constraints, one can optimize over LASr(7f) in time up to numerical 

errors that can be neglected. 

Intuitively speaking, the constraint Mr iy) > 0 takes care that the variables are consis¬ 
tent (e.g. it guarantees that y{i, 2 } e [ypi + y{ 2 } “ l>inin{y{i},y{ 2 }}]). Moreover, the psdness 
of the £ih slack matrices (y) guarantees that y satishes the T’-th linear constraint. 

Let us begin with discussing some of the basic properties, especially that Las r (70 is 
indeed a relaxation of TTn { 0 , 1 }" and that any set of at most r variables looks like the 
convex combination of integral solutions. For a proof, we refer to the survey of Lau¬ 
rent lLau03l or the lecture notes of the 2nd author lRotl3l . 

Lemma 2. Let TT = {x e R" | Ax > b} and y e Las r (K) for r > 0. Then the following holds: 


a) if n {0,1 }” = Las^''°^(70 c ... c LAsf “^(70 £ Las{^''°^(70 £ K. 


b) One has 0 < y/ < y/ < 1 for all fQj with 0 < | /| < |/| < r. 


c) Suppose r > 1 and ie[n] is a variable with 0 < y, < 1. If we define 


„(i) yiu{i] 

Zt 


and Zj := 


( 0 ) yi^ yiu{i] 

^ ■” i-y< 
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then we have y = y; • + (1 - yO ■ with z®, e Las^-i (iO where z;° = 0 , and 


z™ = l. 


If we have vector y £ LASr(i^) and a variable I with 0 < y, < 1, then conditioning on 
Xi = 1 means to replace the vector y with the vector z^^' e Las^-i (i^) with z[^* = 1 . On the 
other hand, if we are conditioning on xi = 0 , then it means to replace yby z^^' e LASr-iCiO 
with zf^ = 0. The correctness for this procedure follows right from property c). 


2.1 The Lasserre Lift for the scheduling LP 

Now, let us go back to the scheduling LP and discuss the properties of the Lasserre lift, ap¬ 
plied to iSTiT) fromEq. {D. Foravectory e LASr(i^(T)) andajob 7 ,wedefinesupp( 7 ,y) := 
{te [T] I yy,f >0} as the support of the job. Here we should mention that for the sake of a 
simpler notation we address the singleton variables y{(j,t)] as y^f whenever convenient. 

Lemma 3. Let ye LASr[K[T)). Then the following holds: 

a) Ifsupp[j,y) Jq} and suppij', y) e {Jq. T} then it is impossible that j' < j. 

b) Suppose that y' is derived from y by any sequence ofk<r conditionings. Then 
y' £ LASr-kiKiT)) and suppij,/) £ suppij, y) for all jobs j £ /. 

Proof. Claim a] already follows from the fact that y represents a feasible fractional so¬ 
lution to KiT). For claim b), by induction it suffices to consider the case where y' is 
derived by a single conditioning. Let’s say we conditioned on = 1 and suppose that 
ynji.ti)} = 0- Then from the definition in Lemma[31c we know that 


TfOi.fi)} - 

using Lemma[2lb in the inequality. 


^ yiQi.ti)} 


yyjo.k)} 


yiih.m 


□ 


3 An overview 

In this section, we will give an overview over the different steps in our algorithm; the 
detailed implementation of some of the steps will be given in the following Sections[ll[5] 
andm For a given time horizon T, a feasible schedule is an assignment a : / — {1,..., T} 
with |c7“^ (f) I < m for all te[T] and for all j, j' £ / one has j < j' ^ aij) < aij'). 

Formally, our main technical theorem is as follows; 

Theorem 4. For any Lasserre solution y e LASriKiT)) with r := 

one can find a feasible schedule a : / ^ N of the jobs in time so thatmaxjsjcrij) < 

(i + £)-r. 
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To obtain a (l+e) -approximation, we can find the minimum value of T so that LASriK{T)) 
0 with binary search and then compute a vector y e LASriK{T)). In particular, by virtue 
of being a relaxation, that value of T will satisfy T < OPT, where OPT is the makespan 
of the optimum schedule. For the sake of a simpler notation, we will assume that T is 
a power of 2 — if < T <2^ for some integer z, then one can add m • (2^ - T) many 
dummy jobs that all depend on each original job so that the algorithm will schedule the 
dummy jobs at the very end. Moreover we will assume that m < log 2 (n) as otherwise 
the bound is meaningless. 

The main routine of our algorithm will schedule jobs only within the time horizon 
T of the Lasserre solution, but we will allow it to discard )obs. Formally this means, we 
will hnd an assignment a : J\ /discarded ^ [/’] that will not have assigned slots to jobs in 
/discarded- Such an assignment will still be called “feasible” if apart from the load bound, 
the condition j < j' ^ a[j) < a{j') is satisfied for all j,j' e J\ /discarded- In particular 
dependencies with discarded jobs play no role in this definition. 

The reason for this definition is that one can easily insert the discarded jobs at the 
very end of the algorithm: 

Lemma 5. Any feasible schedule a : J\ J discarded ^ {1,..., T} can be modified in polyno¬ 
mial time to a feasible schedule a* : / —{!,..., T-t l/discardedll (which also includes the 
previously discarded jobs). 

Proof. Select any job j* e /discarded- Since a is a valid schedule which respects all prece¬ 
dence constraints in / \ /discarded^ there must be a time t* so that all jobs j < j* have 
a(j) < t* and all jobs j with j* < j have a{j) > t*. Then we insert an extra time unit after 
time f *; in this extra time slot, we only process j *. Then we continue the procedure with 
inserting the next j oh from /discarded \ 1/ * 1 - □ 

Now, let us introduce some notation: We can imagine the precedence order as a 
directed transitive graph G = {J,E) with the nodes as jobs and edges (/,/') e E o j < j'. 

In that view, let d'*' ( j) := {]' e J \ j < j'} be the jobs depending on j and let 5~ ( j) := {j' e 
J \ j' < j] be the jobs on which j depends. Note that 6'^ (j) and S~ ( j) are always distinct. 

We abbreviate 5 ( j) := d'*' [j) u d“ [j) as the jobs that have any dependency with j. Finally, 
for a subset of jobs /' e /, let A(/) := max{|d(/) n/| -i-11 / e /} be the maximum degree 
of a node in the subgraph induced by /, counting also the node himself. 

We partition the time horizon [T] into a balanced binary family Z of intervals of 
lengths T,-^,^,...,2,l. LetX := Zo\j...uZi be the binary laminar family of intervals that 
we obtain by repeatedly partitioning intervals into two equally-sized subintervals. Recall 
that each level Z^ contains 2^ many interval / each one consisting of |/| = ^ many 
time units. 

Our algorithm will schedule the jobs in a recursive manner. The main claim is that 
for any interval I*, Lasserre solution y* and a set of jobs J* with suppf/,}/*) e /*, we can 
schedule almost all jobs from J* within 1* while respecting all precedence constraints. 

We use parameters k := log log(T) where Ci > 0 is a large enough constant that 
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we will choose in Section [H and 6 := ——. To get some intuition for the pa¬ 

rameters, considering e and m as fixed constants, one would have k = 0 (log log n) and 
S = 1/ log(n)®^*'’®^°®”*. Formally, the main technical lemma is the following: 

LemmaG. Fixe > 0. Let I* e X be an interval from the balanced family. Lety* e LASr* iK{T)) 
be a Lasserre solution with 


r* >log2{r*)-2mfc^-2^'/5. 


Let J* Q {j e J: y*j j, = 1} be some jobs fully assigned to I*. Then one can find a feasible 
assignment a:/*\ ^ 1* that discards only 


\l 


discarded^ ~ 2 


logaCr) 


• ir 


2m 


many jobs. 

Before we move on to explain the procedure behind LemmalU we want to argue that 
it implies our main result: 

Proof of Theorem^ We set I* := T} and y* := y, then J* := / is a valid choice as 

trivially = 1 for any joh. To satisfy the requirement of Lemma[6]we need 


log2(r) • < {log(n))0«T)^i'>g2iog2(«)) 

manylevels of the hierarchy. Here we use that A: = 0(Yloglogr),hence2^ = (log(r))®^™^'^' 
and 2^^ = (we want to point out that many of the lower order terms 

are absorbed into the 0-notation of the exponent and we assume that j,m< log 2 (n)). 
Then Lemma[6]returns a valid assignment a: J\ /discarded ^ [/’] that discards only 


£ log2(r) £ 

I/discarded I — 77 ' 7 7^77 ' ^ + 77 '\J\ — £ ’ T 

2 log 2 (r) 2m 


many jobs. Inserting those discarded jobs via Lemma[5]then results in a feasible schedule 
of makespan at most (1 -t £) • T. □ 


The rest of the manuscript will be devoted to proving LemmalU We fix a constant 
£ > 0 as the target value for our approximation ratio and denote T* := |/* | as the length 
of our interval. 

Let us first argue, how to handle the base case, which for us is if log 2 (T*) < . In that 
case, we have at most mT* < m2^ jobs. Hence, the Lasserre lift has r* > mT* many 
levels and one can repeatedly condition on events xj^ = 1 for j e J* and tc I* until one 
arrives at a Lasserre vector y* * with y* t e {0,1} for ail / e J*. This then represents a valid 
schedule of jobs J* in the interval 1* without that any jobs need to be discarded. 
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We now come to a high level description of the algorithm. Let he 

the family of suhintervals of I*, where X*^ contains 2^ intervals of length ^ each (see 
Figure[TJ. For a joh j ^ J*, we define £[j,y*) •= max{^ : 3/ e with ^ = 1} as the level 
that owns the joh in the current Lasserre solution. We also abbreviate J{£, y*) := {j e J* \ 
j’ y*) = all jobs owned by level £. The algorithm is as follows: 

• Step 1: Starting with the Lasserre solution y*, we can iteratively condition on 
events until we arrive at a Lasserre vector y** that has the property that for any 
interval 7 £ Xq u ... uX* 2 _^, the jobs owned by that interval have small depen¬ 
dence degree, that means A(/(7,y**)) < 5|/|. If we then consider the set of jobs 
J** '.= {} ^ J* I 0 < 7'(7,y**) < owned by the first levels, the longest chain in 
]** will contain at most k^5T* jobs. We will show in Section |4] that the number 
of conditionings that are needed, can be upperbounded by 2mk^ -2^^ 15, which 
implies thaty** e LAS^,_ 2 ^j^ 2 . 2 k^/s<^K{T)). 

• Step 2: From now on, we work with the modified Lasserre solution y* *. We select 
a level index £* e{k,...,k^} and partition the jobs in J* in three different groups: 

- The jobs on the top levels: /top :=/(0,y**) u...u/(7'* - A:-l,y**) 

- The jobs on the A: middle levels: /middle := /(^* - A:,y**) U...U J{£* - l,y**) 

- The jobs on the bottom levels: /bottom := /(^*,y**) u...u/(log 2 {T*),y**) 

Then we discard all jobs in /middle- In Section [6] we will describe how the index 
£* is chosen and in particular we will provide an upper bound on the number of 
discarded middle jobs. 

• Step 3: In this step, we will find a schedule for the bottom jobs. For this purpose, we 

call Lemma[6]recursively for each interval I e Xf with a copy of the Lasserre vector 
y** and jobs // := {/ e /bottom I Ty / = II- Then Lemma [6] returns a valid schedule 
of the form U/ : // \ //.discarded ^ I for each interval 7 e X/.. Let /bottom-discarded := 
U/ei^f. //.discarded £ /bottom be the Union of jobs that were discarded in those calls. 
The partial schedules U/ satisfy |o'/{f)| < m for tel and \o~j^{t) \ = 0 for f C 7. We 
combine those schedules to a schedule a : /bottom//bottom-discarded ^ I* ■ From the 
disj ointness of the intervals, it is clear that again | a“ ^ (t) | < m for all te I*. More - 
over, if j < j' and j, j e // for some interval 7 e X/., then by the inductive hypothe¬ 
sis C7{/) On the other hand, if / e Jjandj'e ///then we know by Lemma[3lb 

that 7 had to come before I' since y* j = 1 = yj*f 

• Step 4: We continue working with the previously constructed schedule a that sched¬ 
ules the non-discarded bottom jobs. In this step, we will extend the schedule a and 
insert the jobs of /top in the remaining free slots. We will prove in Section [5] that 
this can be done without changing the position of any scheduled bottom job and 
without violating any precedence constraints. Again, we allow that the procedure 
discards a small number of additional jobs from /top that we will account for later. 
Eventually, the schedule a satisfies the claim for Lemma[6l 
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Figure 1: Binary dissection of the interval I* used in the algorithm behind LemmalU 


The intuition behind the algorithm is as follows: When we call the procedure recur¬ 
sively for intervals / e we cannot control where the jobs // will be scheduled within 
that interval I. In particular the decisions made in different intervals 1,1' e will in 
general not be consistent. But the discarding of the middle jobs has creates a gap be¬ 
tween the top jobs and the bottom jobs in the sense that the intervals of the top jobs are 
at least a factor 2^ longer than intervals of the bottom jobs. For a top job j e /top we will 
be pessimistic and assume that all the bottom jobs that j depends on will be scheduled 
just at the very end of their interval. Still, as those intervals are very short, we will be 
able to argue that the loss in the flexibility is limited and most of the top jobs can be pro¬ 
cessed. As a second crucial ingredient, the conditioning had the implication that the top 
jobs do not contain any long chains any more. This will imply that a greedy schedule of 
the top jobs will leave little idle time, resulting in only few discarded top jobs. 

4 Step (1) — Reducing dependence 

In this section we will implement “Step (1) ’’ and show that we can reduce the maximum 
dependence degrees of the jobs owned by the first levels in order to bound the length 
of chains. We will do this by conditioning on up to 2mk^ -2^ 16 many variables. Recall 
that we are considering an interval/* and a subset of jobs J* £/thatthe current Lasserre 
solution y* fully schedules within I*. For one of the subintervals / £ below /*, let us 
abbreviate := {j e J[£,y*) | y*^ = 1} as the jobs owned by that particular interval. 

Lemma?. Lety* e LASr* (//(T)). Then one can find an induced solution y** cLASr^iKiT)] 
with r** ■= r* -2mk^-2^^ 16 so thatAiJihy**)] <(5-|/| for all intervals I eXg u...uX* 2 _^. 


10 



































Proof. We set initially y** := y*. If there is any interval / = / 1 U /2 e Xq u with 

A{/{/,y**)) >5-|/|,thenwe select the joh 7 e /(/,y**) that (hy symmetry) has ( 7 ) u 

{ 7 }! > I • |/|. Then we replace y** hy the Lasserre solution conditioned on the event 
“X 7 J 2 = 1”. Note that this means that all jobs in u { 7 } will he removed from 

J[I, y**). In fact, each such joh will he moved to JU', y**) where I' e f is some suhinter- 
val. The conditioning can also change the assignment of other jobs, but for each job 7 , 
the set suppCj, y* *) can only shrink if we condition on any event, see Lemma|31b. Hence 
jobs only move from intervals to subintervals. 

Since in each iteration, at least | • |f| ^ f • ^ many jobs “move” and each job moves 
at most many times out of an interval in Xq u... uX* 2 _^, we need to condition at most 

2mT*-k^ P 2*"" 

- Tf ,— = 2mk ■ —— 

many times. 

□ 

The implication of Lemma[7]is that the set of jobs assigned to intervals 7 e Xg u ... u 
X* 2 _^ will not contain long chains, simply because we have only few intervals and none 
of the jobs assigned to intervals contain long chains anymore. 


Lemma 8 . After applying Lemma0 the longest chain within jobs assigned to intervals 
7 eXg u... uX* 2 _^ has length at most k^5T*. 


Proof. First, let us argue how many jobs a chain can have that are all assigned to intervals 
of the same level t. From each interval 7, the chain can only include d|7| = 5 -many 


jobs. Since |X^| = 2^, the total number of jobs from level f is bounded by dP*. The 
claim follows from the pigeonhole principle and the fact that we have k^ many levels in 

□ 


X*u...uX*2_,. 


We can summarize the algorithm from Lemma[7]as follows: 

Breaking long chains_ 

Input: Scheduling instance with jobs /*, a precedence order, a Lasserre solution y* e 
LASr* {K{T)), an interval I* 

Output: A Lasserre solution y * * with maximum chain length k^6T* in U^_q^ Jif,y**) 

(1) Make acopy y** := y* 

(2) WHILE 37eU^^o^X* WITH A{/{7,y**)) > d|7| DO 

(3) Chooseajob 7 e/(7,y**) with | 5 /( 7 ,y..)( 7 ) u { 7 }] > 5|7|. 

(4) If |d|(j^..){ 7 ) u { 7 }! ^ f • |f| THEN condition in y** on xjj^ = 1 
ELSE condition on = 1. 

Note that after each conditioning in step ( 6 ), the vector y** will change and the set 
7(7, y* *) will be updated. 
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5 Step (4) — Scheduling top jobs 


Consider the algorithm from Section |3] and the state at the end of Step 3. At this point, 
we have a schedule a that schedules most of the bottom jobs. The main argument that 
remains to be shown is how to add in the top jobs which are owned by intervals in u 

This is done in two steps. First, we use a matching-based argument to show that most 
top jobs can be inserted in the existing schedule so that the precedence constraints with 
the bottom jobs are respected. In this step, we will be discarding up to -T* many 

jobs. More crucially, the schedule will not have satisfied precedence constraints within 
/top. In a 2nd step, we temporarily remove the top jobs from the schedule and reinsert 
them with a variant of the Earliest Deadline First (EDF) scheduling. As we will see later 
in Theorem[T0l this results in at most • T* additionally discarded jobs. 

5.1 A preliminary assignment of top jobs 

Let us recall what we did so far. In Step 3, we applied Lemma[ 6 ]recursively on each inter¬ 
val / £ Ir- to schedule the bottom jobs. We already argued that the resulting schedules 
could be combined to a schedule cr: /bottom//bottom-discarded ^ J* that respects all prece¬ 
dence constraints. 

Let the intervals in I*, be called /i. . Ip, so that the time horizon T* is partitioned 

into p equally sized subintervals with p = 2^ . After reindexing the time horizon, let us 

assume for the sake of a simpler notation that I* = {1 . T*}. If we abbreviate t; := 

i ■ ^ for i e { 0 ,..., p}, then the ith interval contains the time periods := {/-i -i-1,..., ffi. 
Each time f has an available capacity of cap (t) = m-\a~^[t) \ e many machines, 

which is the number of machines not used by jobs in /bottom- We abbreviate cap!/;) := 
T-tEii cap(f) as the capacity of interval 

The available positions of jobs in /top are constrained by the scheduled times of jobs 
in /bottom- As we had no further control over the exact position of the bottom jobs within 
their intervals It , we want to define for each j oh j e /top a release time rj and a deadline dj 
determined by the most pessimistic outcome of how a could have scheduled the bottom 
jobs. For all j e /mp, we define 

rj := min { h + 1 | a{f) <ti\Jj'e /bottom with f < j} (2) 

dj := max{ti \ a{j') > / + 1 V/' £ /bottom with / < /'} 

In particular, the release time will be the first time unit of an interval /, and the deadline 
will be the last time unit of an interval fi. Let irij) and idij) be the corresponding in¬ 
dices, so that the release time is of the form rj = hv(/)-i +1 and the deadline is dj = ti^{j). 
Then our goal is to show that most top jobs j can be scheduled somewhere in the time 
frame u... u Ii^{j) ■ This would imply that at least all precedence constraints between 

bottom and top jobs are going to be satisfied. 
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Figure 2: Visualization of interval I* = /lU... u/p and possible release times and dead¬ 
lines forajob j e /top. Note that supp( 7 ,y**) might be stretched over tbe whole hatched 
area, while our choice of rj and dj will force us to process j inside the black-hatched 
area (or to discard the job). 


Notice here that the existing fractional assignment that y** provides for j e /top 
might be placed up to one interval outside of the range [rj,dj] (see Figure |2j. This is 
due to our requiring the release time and deadline to correspond to interval beginnings 
and endings. Let := /bottom \ /bottom, discarded be the majority of bottom jobs that 

our schedule processes. 

Lemma 9. A valid schedule a : ' I* of bottom jobs can be extended to a schedule 

^ • ^bottom ^top J'top - Jtop that includes most of the top jobs. The schedule 

satisfies (i) |(7“Hf)l ^ m for t e I*; (ii) rj < aj < dj for all j e and (Hi) one discards at 
most \Jtop \ /topi -T* many top jobs. 

Proof. We want to use a matching-based argument. For this sake, we consider the bi¬ 
partite graph with jobs on one side and subintervals on the other. Formally, we define 
a graph G = [V, U, E'^) with V = /mp, U = {I,..., p} where the nodes i e U have capacity 
cap(7i), and edges 

eVx i/I maxfi'jTj)- 1,1} < i<mm{id{j] + l,p}}. 

Then the neighborhood of each top job includes every interval in which it is fraction¬ 
ally scheduled in y**. Moreover, each of the bottom jobs j e has been assigned 

by a to precisely that interval f with y** = 1. Hence y** gives a V-perfect fractional 
matching that respects the given capacities cap(/i). In bipartite graphs, the existence of 
a fractional V-perfect matching implies the existence of an infegraZ V-perfect matching, 
see e.g. lSch03l . 

However, in order to assign the top jobs to slots within release times and deadlines 
we are only allowed to use the smaller set of edges 

E = {[j, i)eVxU\ irij) < i < idf- 

For any J* e y, we let AZ'*' (/*) be the neighborhood of J* along edges in E"^ and let NiJ*) 
be its neighborhood along edges in E. Let the magnitude of a neighborhood |AZ(/*)| be 
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defined as the sum of capacities of the nodes it contains. By Hall’s Theorem |Sch03l , the 
minimum number of exposed V -nodes in a maximum matching in E is precisely 

max{|/|-|Af(/)|}. 

JQV 

Now, fix the set J* qV attaining the maximum; then |/* | - |Ar{/*)| is the number of jobs 
that we have to discard. Since E'^ allows for a ^-perfect matching, the reverse direction 
of Hall’s Theorem gives that |/* | < | N+ {/*) |. Thus |/* | -1 N[J *) | < | Ar+ (/*) | -1 N{J* ] |. Note 
that N(J*] is in general not a consecutive interval of {1,..., p}. We can upper bound the 
difference | N'^ (/*) I - I N(/*) | by 2 m • times the number of connected components of 
This is the point where we take advantage of the “gap” between the levels of the 
top and bottom jobs; for each job j eV there is an interval I eZqU... so that 

W" ij) contains the midpoint of that interval. Due to the gap, there are at most p-2 ■2~^ 
such interval^ Hence the number of discarded jobs can be bounded by 

\J*\-\NiJ*)\<2m - 2p-2~’^ = 4m-2~’^-T*. 

P 

Finally note that a corresponding maximum matching can be computed in polynomial 
time. □ 


5.2 Reassigning the top jobs via EDF 

We have seen so far that we can schedule most of the bottom and top jobs so that all 
precedence constraints within the bottom jobs are satisfied and the top jobs are correctly 
scheduled between the bottom jobs that they depend on. However, the schedule as it 
is now ignored the precedence constraints within the top jobs. In this section, we will 
remove the top jobs from the schedule and then reinsert them using a variant of the 
Earliest Deadline First (EDF) scheduling policy. 

For the remainder of Section ISlZl we will show a stand-alone theorem that we will 
use as a black box. Imagine a general setting where we have m identical machines and 
n jobs /, each one with integer release times rj and deadlines dj and a unit processing 
time. The EDF scheduling rule picks at any time the available job with minimal dj for 
processing. It is a classical result in scheduling theory by Dertouzos IDer74l that for 
m = 1 and unit size jobs, EDF is an optimal policy. Here “optimal” means that if there 
is any schedule that finishes all jobs before their deadline, then EDF does so, too. The 
result extends to the case of arbitrary running times pj if one allows preemption. 

Now, our setting is a little bit different. For each time t, we have a certain number 
cap{f) £ {0,..., m} of slots. Additionally, we have a precedence order that we need to re¬ 
spect. But we can use to our advantage that the precedence order has only short chains; 
moreover, the number of different release times and deadlines is limited. 

Formally we will prove the following: 

^It is possible that N{j) = 0. StiU wiU contain a midpoint of a level 0,... * - fc - 1 interval, hence 

we have accounted for those jobs as well. Note that such jobs would automatically get discarded. 


14 







Theorem 10. Let J be a set of jobs with release times rj, deadlines dj and consistent 
precedence constraintil with maximum chain length C. Suppose that { 1 ,..., T} is the 

time horizon, partitioned into equally sized intervals h . Ip and all release times/deadlines 

correspond to beginnings/endings of those intervals. Let cap : [T] ^ be a ca¬ 

pacity function and assume that there exists a schedule (T : / — [T] assigning each job to 
slots between its release time and deadline that respects capacities but does not neces¬ 
sarily respect precedence constraints within J. 

Then in polynomial time, one can find a schedule a: J\ /discarded [ T’] that respects 
capacities, release times, deadlines and precedence constraints and discards \Jdiscarded\ ^ 
pmC many jobs. 

We use the following algorithm, which is a variant of Earliest Deadline First, where 
we discard those jobs that we cannot process in time: 

Earliest Deadline First_ 

Input: Jobs J with deadlines, release times, precedence constraints; 
capacity function cap :{T]^{0,...,m} 

Output: Schedule a: J lT]u {DISCARDED} 

(1) Set o{j)\= UNASSIGNED for all j ej and /discarded := 0 

(2) Sort the jobs / = |1,..., n} so that di<d 2 <...<dn 

(3) FORf=lTOrDO 

(4) FOR cap (t) MANY TIMES DO 

(5) Select the lowest index j of a job with rj < t< dj that has not been sched¬ 
uled or discarded and that has all jobs in 5~ ( j) already processed (or dis¬ 
carded). 

(6) Set a( j) := t (if there was any such job) 

(7) FOR each j e J with dj = t and o{j) = UNASSIGNED, add j to /discarded and set 
aij] := DISCARDED 

At the end all jobs j will be either scheduled between rj and dj (that means rj < a(_j] < 
dj] or they are in /discarded- 

We will say that a job j was discarded in the interval [t, t'] if j e /discarded and t<dj< 
t'. We call a time t busyii |ff“Hf)| = cap(f) and non-busy otherwise. Fetus make a useful 
observation: 

Lemma 11. Let I = {t',..., t"} e be part of one of the subintervals. Suppose that there 
is a non-busy time t* el and a job j with IQ [rj,..., dj] and aij] e {DISCARDED} 
l,...,T}. Then there is a job j* e a~^it*] with j* < j. 

Proof. Consider any inclusion-wise maximal chain of jobs /i < jz < ... < jq that ends 
in j = jq and otherwise has only jobs ji,...,jq-i e a~^[{t*,..., t"}]. First suppose that 
<7 > 1 and hence ji ^ j. it is impossible that aiji] > t* because by assumption rj^ < t* 

^Here “consistent” means that for a pair of dependent jobs j < j' one has rj < rji and dj < dji. 
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and hence EDF would have processed ji already earlier at time t* (hy maximality of the 
chain, there is no joh scheduled at times t*- I on which j\ might depend). 
Hence crCji) = t* and hy transitivity ji < j, which gives the claim. 

In the 2nd case, we have ji = j, hence there is no joh that j depends on scheduled 
between t* and t". But we know that rj < t*. Thus EDF would have processed j at time 
t* or earlier. □ 

With this observation we can easily limit the number of non-busy times per interval: 

Lemma 12. Let I = {t',..., t''} e be part of one of the subintervals. Suppose that there 
is at least one job j with IQ {rj,...,dj} andaij) e {DISCARDED} u {t” + l,...,T}. Then the 
number of non-busy times in I is bounded byC. 

Proof. By LemmafTTl for the latest time t* e I with |c 7 {t*)| < cap{t*), there is at least one 
job j* with j* < j. Then we can continue by induction, replacing t" with t* -1 

and replacing j by j * to build a chain of jobs ending with j that includes a job scheduled 
at each non-busy time. Since no chains can be longer than C, this gives the claim. □ 

Now we come to the main argument where we give an upper bound of the number 
of discarded jobs: 

Lemma 13. One has \Jdiscarded\ ^ pmC. 

Proof. Suppose that l/discardedi > p-K;we will then derive a bound on K. By the pigeon¬ 
hole principle, we can find an interval 1^ so that at least K many jobs get discarded in 7^. 
Let us denote the lowest priority (i.e. the latest deadline) job that gets discarded in /j, by 
js. Now delete all those lower priority jobs js+i,..., jn- Note that this does not affect how 
EDF schedules ji,..., js and still we would have at least K jobs discarded in 7^, including 
js. By LemmafT^ the number of non-busy periods in 7^ is bounded by C. Now, choose a 
minimal index a e {1,..., fi -1} so that in each of the intervals la,..., It one has at most C 
many non-busy periods. We abbreviate 7' := 7^ u... u 7i,. Note that by definition la-i has 
more than C many non-busy period^. Define 

/ := {; £ {ji,..., jsl: (a( 7 ) e 7') or (j discarded and dj e 7')}. 

By Lemma[TTJ any job in / has its release time in la or later, since otherwise we could 
not have C-i-1 non-busy times in la-\- Now, let us double count the number of jobs in 
/. On the one hand, we have 

b b 

1/ I — ^ | 0 ' (7;)I + \J n/discardedI — ^ (cap(7;) — mC) + 1/ H/discarded[ 

i=a i=a ' 

>K 

> capil')-pmC-rK. 

^Admittedly it is possible that a = 1 in which case one might imagine /q as an interval in which all times 
are non-busy and which does not contain any release times. 
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On the other hand, we know that there is an assignment a of jobs in / to slots in 7^,..., Ijj. 
That tells us that |/| < cap(7'). Comparing both bounds gives that K < pmC. □ 


6 Step (2) — Accounting for discarded jobs 


In this section, we need to argue that the level (* can be chosen so that the total number 
of jobs that are discarded in Steps (l)-{4) are bounded by 


\]* I ^ g log 2 (l^ -17*1 I ^ - \J* 
l^discardedl - 2 log 2 {r) 2 m ^ 


as claimed. Let us summarize the three occasions in the algorithm where a job might get 
discarded: 


(A) In Step (3), in order to schedule the bottom jobs, we have 2^ many recursive calls 
of Lemma[ 6 ]for intervals 7 e . The cumulative number of discarded jobs from 
all those calls is bounded by 


e ^ogzi^) T* 

T log2(r) 


+ „ ’I /bottom I — „ 
2 m 2 


£ iog2(r)-r 


iog 2 (/’) 


// + „ ■ l/bottomi 

2 m 


(B) As we have seen in Section[5l the number of top jobs that need to be discarded in 
Step (4) can be bounded by 

4m • 2“^ • r* + pmC < 4m • 2“^ • T* + T*5<- - ^ - T*. 

4 log2(r) 

Here we use that the length of the maximum chain within the top jobs is C < 
k^5T*. Moreover, we have substituted the parameters p = 2^ <2^ as well as 
^ " 8 fc^m 2 ^^^iog,(r) ^ f log 2 log 2 ( T] with a large enough constant Ci > 0. 

(C) In Step (2), we discard all the middle jobs. In the remainder of this section we prove 
that there is an index £* so that 


I/middle I — 


4 iog2(r) 


„ ■ (l/middlel + l/topi) 

2 m 


Let us first assume that we can indeed find a proper index £* so that the bound in (C) 
is justified. Then the total number of jobs that the algorithm discards is 


(A) 


(B) 


(C) 


£ iog2(r)-r 

2 


log 2 (/) 


■T* + - 


2m 


■ I/bottom I + 


4 iog2(r) 


' T + ' , ^ o ' l/topD 

4 log 2 (T) 2m 


£ l 0 g(r*) £ 

“ ' ■; 77^ + 7, ■ (l/topi + l/middlel + l/bottomi) 

2 log 2 (r) 2 m ^ 1 .-. 

=17*1 
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which is the hound that we claimed in LemmalU 

It remains to justify the claim in (C). We abbreviate at := \J{i • fc, y**) u... u Jiii + 1) • 
fc- l,y**)| for i e {0,., fc- 1}. In words, each number at represents the number of jobs 
owned by k consecutive levels. We observe that if there is an index i £ {0,..., fc -1} so that 


(/) ai< 


4log2(r) 




or 


ill] — 
2m 


aj 


;=i 


then we can choose (* := ii + 1 ) • A: and (C) will be satisfied. Here we use that for this 
particular choice of^*, we will have l/middlel = oci and |/top| = ao + --- + «i-i- 

So, we assume for the sake of contradiction that no index i satisfies either {/) or [11] 
(or both). Then one can easily show that the ai’s have to grow exponentially. We show 
this in a small lemma; 


Lemma 14. Let qeN and suppose we have a sequence of numbers ao, ai,..., nw satisfy¬ 
ing Ui > amin > 0 and ai > • Ll-i ct / for alli = 0,..., N. Then a, > • amin- 

6 / J — L J 


Proof. Group the indices into consecutive blocks of 2q numbers, where ap,..., a 2 q-i is 
block 0, azq, ■■■, a^q-^ is block 1 and so on. We want to prove by induction that each a, 
in the jth block is at least 2^ • amin- For j = 0, the claim follows from the assumption. For 
j > 0 , we use that a, is at least the sum of 2q numbers that by inductive hypothesis are 
all at least 2^~^ ■ amin- The claim follows. □ 

Applying LemmafTTlwith amin := nog^m ' q:=^we obtain in particular that 


afc_i - 


4iog2(r) 


If we set k = log 2 log 2 T for some adequately large Ci, then a^-i > mT*, which is a 
contradiction since we only have \J*\ < m-\I*\ = mT* many jobs with yy/» = 1. 


7 Conclusion 

For the proof of Lemma [ 6 ] we already argued that the number of discarded jobs sat¬ 
isfies the claimed bound and that all precedence constraints will be satisfied. What 
concerns the number of rounds in the hierarchy, recall that we started with a vector 
y* £ LASr* (f^(T)) with r* > log 2 (r*) • 2mA:^ • 2^^/5. Then we apply a round of condi¬ 
tionings in Lemma[7]to obtain y** £ Las,-(.A f(T)) with r** := r* -2mk^ -2^^ 15. We 
use copies of the vector y* * in our recursive application of Lemma [6] to intervals of 
size T* 12^ . Since > A: > 1, the remaining number of Lasserre rounds satisfies r** > 
log 2 (r* !£*] ■2mk^ -2^ 15. Thus we still have enough Lasserre rounds for the recursion. 

Another remark concerns why we may assume that the precedence constraints are 
consistent in Theorem fTOI Suppose we have jobs j,j' e /mp with j < j' and consider 
the definition of release times and deadlines in Eq.|2] By transitivity, any job j” £ /bottom 
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with / < j" which limits the deadline of / will also limit the deadline of j, hence dj < dji. 
Similarly one can argue that rj < rj’. 

This concludes the proof of Lemma[6]and our main result follows. 
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